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Description 

METHOD OF MOTION VECTOR 
DETERMINATION IN DIGITAL VIDEO 

COMPRESSION 

Background of Invention 
[0001] i. Field of the Invention 

[0002] The present invention relates to digital video, and more 

specifically, to digital video compression techniques using 
motion estimation. 

[0003] 2. Description of the Prior Art 

[0004] Digital video is a popular means of information communi- 
cation. Computer-based video conferencing and digital 
television are just two examples of applications of this 
technology. As a single digital image can contain hun- 
dreds of thousands of pixels and many sequential image 
frames are required to produce a quality video effect, 
compression schemes are required for efficiency in pro- 
cessing time and storage space. 



[0005] | n general, image compression is effected by determin- 
ingcorrelations among regions of pixels. When high com- 
pression is required due to limits in data transmission 
speeds for example, either video quality must be sacri- 
ficed or additional compression hardware and software 
must be provided. Where low amounts of compression are 
tolerable, video quality can be maintained at the cost of 
an increase in memory demand. Among this complex ar- 
rangement of trade-offs, any increase in compression that 
does not significantly lower video quality or increase the 
necessary hardware is undoubtedly beneficial. 

[0006] Referring to Fig.l, consider a typical digital video 10 dis- 
playing an image 12 with a current frame 14 and a previ- 
ous frame 16. The current frame 14 is made up of pixels 
20 of a video memory or display device such as a com- 
puter monitor or digital television set. A set of pixels 20 
can be grouped into a current block 18 for the purposes 
of video compression, which is typically facilitated by mo- 
tion estimation. Examples of state-of-the-art motion es- 
timation compression schemes are MPEG-1 and MPEG-2, 
although others are widely known and used. 

[0007] During motion estimation, temporal redundancies of 

video frames are used to minimize data repetition thereby 



providing compression. Pixel information of a block un- 
dergoing compression, such as the block 18, is compared 
with pixel information of the previous frame 16. Specifi- 
cally, the previous frame 16 is searched for a predictor re- 
gion 22 that contains a substantial amount of pixel data 
of the block 18. Once a suitable predictor region is found, 
it is identified by a motion vector, which indicates the po- 
sition of the predictor region 22 relative to the current 
block 18. That is, the data of the block 18 comprises a 
motion vector indicating the location of the predictor re- 
gion 22 in the previous frame 16, as well as any data rep- 
resenting variations in pixel information from the predic- 
tor region 22. Thus, the block 18 of the current frame 14 
can be defined in terms of its displacement from the pre- 
vious frame 16. 

[0008] There are numerous ways to search the previous frame, or 
reference frame, for a suitable predictor block. These 
range from slow pixel-based full search methods to sam- 
pling fast motion estimation methods. Fig. 2 illustrates the 
well-known three-step fast motion estimation method. A 
region 30 represents the area of the reference frame to be 
searched, in which nodes of the grid represent predeter- 
mined search regions. A search region may be of any size, 



from a single pixel to a large array of pixels, and is typi- 
cally the same size as the block to be compressed, which 
is also typically spatially coincident with the center region 
Rl. According to the first step of the three-step method, 
pixel information of the block to be compressed, the cur- 
rent block, is compared with the pixel information of the 
search regions R1-R9. Values of a cost function defining 
the correlation of pixel information are determined for 
each region R1-R9, and the region having the lowest cost 
function (region R4 in Fig. 2) is selected for the second 
step. In step two, search regions S2-S9 surrounding re- 
gion R4 are compared with the pixel information of the 
current block, and again a region having the lowest cost 
function (region S7 in this case) is selected for the third 
and final step. Finally, the third step compares the current 
block with search regions T2-T9, of which the region hav- 
ing the lowest cost function (region T2) is selected as the 
region that best matches current block. The result of the 
third step is location of the predictor region for the cur- 
rent block being compressed, and such result can be 
identified by a motion vector (for region 12 this would be 
a vector (1,3)) in the reference frame. 
[0009] Typically, the cost function is a sum of absolute differ- 



ences (SAD) or a sum of squared differences (SSD) as ex- 
pressed by the following functions: 
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[0010] where, 

[° 011 ] P is pixel data, such as a luminancevalue, of the current 

curr 

block being compressed;P ref is pixel data of a search re- 
gion in the reference frame (such as regions R1-R9, 
S2-S9, T2-T9 of Fig. 2); 

[° 012 ] i, j are horizontal and vertical indices of the pixels of the 
current block and search regions; 

[0013] n j m ar e the horizontal and vertical sizes of the current 
block and search regions; andu, v are horizontal and ver- 
tical offsets of the search regions of the reference frame 
with respect to the location current block in the current 
frame (for region R5, u = -4 and v = 0). 

[0014] Although not without its benefits, the three-step method 



can suffer from a local minimum deficiency. For instance, 
in the example of Fig. 2, although in the first step the re- 
gion R4 was found to best correlate with the current 
block, a region XI may actually provide the lowest overall 
cost function. In detail, given the search region 30, sup- 
pose the actual motion vector is defined by XI. Using the 
three-step search as an exemplary motion estimation 
method, during the first step, the three-step process may 
determine the candidate with the smallest cost function to 
be R4. Following the direction of R4, the three-step 
search may ultimately lead to an incorrect motion vector 
such as that defined by T2. This is because, once a region 
is selected in the first step, the result of the three-step 
search is limited to proximate regions. The three-step 
method provides no contingency for this effect, that is, 
after the region R4 is selected in the first step, the most 
suitable region XI cannot be arrived at during the second 
and third steps. This local minimum deficiency can be 

found in other state-of-the-art methods as well. 
Summary of Invention 

[0015] it is therefore a primary objective of the claimed invention 
to provide a method of determining a motion vector for a 
current block that considers motion vectors of a block 



proximate to the current block, a coincident block in a 
different frame, and a predefined search pattern to reduce 
the probability of a local minimum error. 
[0016] Briefly summarized, the claimed invention method corre- 
lates pixel information of a current block with: (a) pixel 
information of a reference frame indicated by a first mo- 
tion vector of a block proximate to the current block, (b) 
pixel information of the reference frame indicated by a 
second motion vector, the second motion vector being of 
a block of the reference frame spatially coincident with 
the current block, and (c) pixel information of predeter- 
mined regions of the reference frame. Corresponding cost 
functions of the motion vectors are determined before the 
method selects at least one of the first, second, and third 
motion vectors having the lowest cost function as a candi- 
date motion vector. The method then further correlates 
pixel information of the current block with pixel informa- 
tion of regionsoffset from the regions indicated by the 
candidate motion vectors to determine refined candidate 
motion vectors and corresponding refined cost functions, 
before finally, selecting the refined candidate motion vec- 
tor having the lowest refined cost function as the motion 
vector of the current block. 



[0017] According to the claimed invention, each of the first and 
second motion vectors can be discarded if the corre- 
sponding cost function is above a threshold level. 

[0018] it is an advantage of the claimed invention that the motion 
vector of the proximate block as well as those of a coinci- 
dent block and predefined regions are considered as can- 
didates for the current block, thereby improving the accu- 
racy of motion vector determination. 

[0019] it j S a further advantage of the claimed invention that a 
proximate block motion vector and a coincident block 
motion vector are discarded when the corresponding cost 
functions are above the threshold level, thereby speeding 
the process of motion vector determination when pixel in- 
formation of the proximate block correlates poorly with 
that of the current block. 

[0020] These and other objectives of the claimed invention will 

no doubt become obvious to those of ordinary skill in the 

art after reading the following detailed description of the 

preferred embodiment that is illustrated in the various 

figures and drawings. 
Brief Description of Drawings 

[0021] pig.l is a schematic diagram of frames of a digital video. 



[0022] pig. 2 is a schematic diagram of a three-step motion vec- 
torsearch method according to the prior art. 

[0023] pig. 3 is a block diagram of a video system for performing 
the present invention method. 

[0024] Fig. 4 is a schematic diagram of the present invention 
method. 

[0025] Fig. 5 is a flowchart of a first embodiment of the present 

invention method. 

[0026] Fig. 6 is a flowchart of a second embodiment of the 

present invention method. 
Detailed Description 

[0027] General: 

[0028] Fig. 3 illustrates a video system 40 capable of performing 
the present invention motion vector determination 
method. The video system comprises a random-access 
memory 44 and a processor 46 for executing a compres- 
sion algorithm 48 according to the present invention. The 
compression algorithm may be stored in the RAM 44 or in 
a similar memory dedicated to the processor. The video 
system 40 receives a raw video signal from a video source 
42, such as a digital video camera, at the RAM 44 though 
an interface (not shown). The processor 46 then executes 



the compression algorithm 48 on the raw video data 
stored in the RAM 44 and outputs compressed video data 
to a display or storage device 50 via an output interface 
(not shown). In practical applications, the video source 42 
may be a recorded or live program digitally broadcast to a 
digital television set comprising the RAM 44, processor 
46, and display 50. Likewise, the source 42 may be a 
computer-based digital camera, with the RAM 44 and 
processor 46 being part of the computer, the display de- 
vice 50 being a remote monitor to which compressed 
video is sent across a network. 
[0029] pig. 4 illustrates a schematic diagram of a preferred em- 
bodiment of the present invention method. Two frames of 
a digital video are shown, a current frame 60 and a refer- 
ence frame (reference image) 70. The reference frame 70 
is typically the immediately previous frame, but this is not 
a requirement. Pixel information of blocks of the current 
frame 60 to be compressed, and at any given time, a cur- 
rent block 62 is undergoing compression. The present in- 
vention considers three sets of motion vectors for com- 
pression of the pixel information of current block: motion 
vectors of nearby (proximate) blocks, a motion vector of a 
block at the same spatial location but in a different frame, 



and motion vectors determined by predefined locations 
relative to the location of the current block. Cost functions 
of these candidate motion vectors are then compared ac- 
cording to predetermined rules to determine two most 
suitable motion vectors for further correlation. Finally, af- 
ter further correlation utilizing any number of well-known 
methods, the motion having the lowest cost function value 
is selectedfrom the candidates as the motion vector of the 
current block. 
[0030] Proximate Blocks: 

[0031] Referring to Fig. 4, blocks 64, 66, 68 proximate to the cur- 
rent block 62 are selected. The specific proximate blocks 
can be selected according to sound design procedures, 
with proximate blocks already having motion vectors de- 
termined being efficient choices. Though three proximate 
blocks 64, 66, 68 are selected in the preferred embodi- 
ment, any number of proximate blocks can be used. Re- 
spective motion vectors Al, A2, A3 (shown as points in 
Fig. 4) of the proximate blocks 64, 66, 68 and the mean of 
these motion vectors A4 are then compared. Specifically, 
pixel information of the current block 62 is compared to 
pixel information of regions in the reference image 70 
spatially relative to the current block 62 as indicated by 



the motion vectors Al, A2, A3, A4. For instance, the mo- 
tion vector Al of the proximate region 64 is used to lo- 
cate a region in the reference image 70 relative to a posi- 
tion in the reference image 70 corresponding to the posi- 
tion of the current block 62 in the current frame 60. As an 
example, if the coordinates of the current block 62 in the 
current frame 60 are (64, 48) and the motion vector Al is 
(-3,-2), pixel information of the region at (61,48) in the 
reference image 70 would be correlated to pixel informa- 
tion of the current block 62. Such comparison of pixel in- 
formation is a cost function comparison and can be per- 
formed according to any well-known method such as the 
SAD or SSD methods previously described. The motion 
vector of the motion vectors Al, A2, A3, A4 having the 
lowest cost function value, i.e. best correlation, is then 
selected as a potential motion vector or point PI. Natu- 
rally, in other embodiments, more than one motion vector 
Al, A2, A3, A4 could be selected for consideration. 

[0032] Determination of the potential motion vector PI is illus- 
trated in step 100 of the flowchart of Fig. 5 and in step 
200 of the flowchart of Fig. 6. 

[0033] Co-located Block: 

[0034] a block 72 at the same relative location in the reference 



frame as the current block 62 in the current frame 60 is 
selected. The motion vector of this co-located (or coinci- 
dent) block 72 is designated as potential motion vector 
P2. Of course, the co-located block 72 need not be in the 
reference frame 70 provided it is in a frame other than the 
current frame 60. Additionally, the co-located block 70 
should preferably already have its motion vectordeter- 
mined to prevent redundant calculation. 

[0035] Determination of the potential motion vector P2 is illus- 
trated in step 100 of the flowchart of Fig. 5 and in step 
200 of the flowchart of Fig. 6. 

[0036] Predetermined Regions: 

[0037] Regions indicated by predetermined points, such as the 

regions R1-R9 of Fig. 2, are selected. In Fig. 2, these points 
are shown as broken-line circles R, and although they are 
illustrated in the current frame 60 for convenience, they 
actually indicate regions in the reference frame 70. The 
motion vectors corresponding to these points R are used 
to locate regions in the reference image 70 that are then 
correlated with the current block on a cost function basis. 
Motion vectors of two of the regions R having the best 
correlations, i.e. lowest cost function values, are desig- 
nated as potential motion vectors P3, P4. Each of the po- 



tential motion vectors P3, P4 is selected in much the same 
way as the resulting point of the first step of the well- 
known three-step method is determined. In the preferred 
embodiment, two potential motion vectors P3, P4 are 
used, however, more or fewer are also acceptable. 

[0038] Determination of the potential motion vectors P3, P4 is il- 
lustrated in step 100 of the flowchart of Fig. 5 and in step 
200 of the flowchart of Fig. 6. 

[0039] Candidate Motion Vectors: 

[0040] | n the present invention, once the potential motion vec- 
tors PI, P2, P3, P4 are determined, they are then reduced 
to two candidate motion vectors CI, C2. This can be done 
according to a suitable decision rule, two of which are de- 
scribed as follows. 

[0041] According to the first decision rule, the candidate motion 
vectors CI, C2 are selected as the two potential motion 
vectors PI, P2, P3, P4 having the two lowest calculated 
cost functions. As these cost functions have already been 
determined when determining the potential motion vec- 
tors PI, P2, P3, P4, calculations need not be repeated. The 
first decision rule is illustrated in step 102 of Fig. 5. 

[0042] According to the second decision rule, cost functions of 
the potential motion vectors PI, P2 resulting from the 



proximate blocks and the co-located block are compared 
with a suitable threshold value. The potential motion vec- 
tors PI, P2 are independently discarded if the respective 
cost functions are above the threshold value. Then, cost 
function values of the non-discarded, if any, potential 
motion vectors PI, P2 and the potential motion vectors 
P3, P4 from the predetermined regions are compared, 
with the two potential motion vectors having the two low- 
est cost functions being selected as the candidate motion 
vectors CI, C2. The second decision rule is illustrated in 
steps 202 and 204 of Fig. 6. Naturally, only one candidate 
motion vector need be determined, however, two provide 
additional insurance against a local minimum. 

[0043] other decision rules are of course possible, with the spe- 
cific parameters of such being determined based on the 
service requirements of a device incorporating the present 
invention method. 

[0044] Further Correlation and Selection: 

[0045] After the candidate motion vectors CI, C2 are determined, 
they are used as starting vectors for any well-known 
search method. For example, such refinement of the can- 
didate motion vectors CI, C2 can be according to the re- 
maining two steps of the three-step method previously 



described. After performance of a refining search method, 
the cost function values of the resulting motion vectors 
are then compared, with the motion vector having the 
lowest cost function being selected as the motion vector 
of the current block. This process is illustrated in Fig. 5 
and Fig. 6 as steps 104, 106 and steps 206, 208 respec- 
tively. 
[0046] Summary: 

[0047] | n practical application, the above-described method of 
determining a motion vector of a current block can be 
performed by the processor 46 of Fig. 3. Instructions exe- 
cutable by the processor 46 that define the method can be 
stored as the compression algorithm 48 or separately in 
the RAM 44. The present invention method can be real- 
izedwith hardware and software, the advantages of each 
dependant on the specific application. 

[0048] | n contrast to the prior art, the present inventionconsiders 
motion vectors of proximate blocks, that of a co-located 
block, and those calculated by predefined search regions 
to determine a most suitable motion vector for a current 
block. That is, thepresent invention methodextends a mo- 
tion vector search area to highly probable and already de- 
termined locations outside the predetermined regions. In 



this way, the present invention reduces the chances of the 
local minimum problem occurring during digital video 
compression. 

[0049] Those skilled in the art will readily observe that numerous 
modifications and alterations of the device may be made 
while retaining the teachings of the invention. Accord- 
ingly, the above disclosure should be construed as limited 
only by the metes and bounds of the appended claims. 



